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This listing of claims will replace all prior versions, and listings, of claims in the 
application: 

Listing of Claims 

1 . (Currently Amended) A method for handling redirects in documents, comprising: 
while generating an index, 

forming at least one equivalence class that includes documents that are connected through 
a redirec t, wherein each equivalence class describes a redirect chain ; 

detecting cycles for each equivalence class, wherein documents in a cycle are marked so 
that they are not indexed , and, wherein, for each equivalence class, the cycle is formed when a 
last document in the redirect chain redirects to a first document in the redirect chain ; 

detecting incomplete chains for each equivalence class, wherein documents in an 
incomplete chain arc marked so that they arc not indexed , and, wherein, for each equivalence 
class, the indirect chain is formed when a last document in the redirect chain redirects to a 
document that has not been crawled ; and 

selecting a representative for each equivalence class whose documents are to be indexed, 
wherein the representative is associated with a path that indicates a location of a document in a 
data store. 

2. (Original) The method of claim 1, wherein the representative is selected based on 
a type of redirect in an equivalence class. 

3. (Original) The method of claim 1, wherein the representative is selected based on 
a rank of each document in the equivalence class. 

4. (Original) The method of claim 1, further comprising: 
locating each document that contains a redirect; and 
creating an entry in a redirect file for each document. 

5. (Original) The method of claim 4, wherein the entry includes a source path, a 
target path, and a redirect type. 
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6. (Original) The method of claim 1, further comprising: 
detecting duplicate documents in two different equivalence classes; and 
merging the equivalence classes. 

7. (Original) The method of claim 6, wherein documents are duplicates if a certain 
portion of their content is similar. 

8. (Original) The method of claim 1, wherein the documents in the at least one 
equivalence class include a target document and one or more source documents and wherein the 
selected representative is one of the source documents, further comprising: 

propagating the content of the target document to the selected representative. 

9. (Original) The method of claim 1, wherein the documents in the at least one 
equivalence class include a target document and one or more source documents, and wherein at 
least one source document includes a path to the target document. 

10. (Original) The method of claim 9, further comprising: 

indexing the content of the target document with a path of the representative. 

1 1 . (Original) The method of claim 1 , wherein marking documents so that they are 
not indexed includes marking documents to indicate the documents are to be ignored. 

12. (Original) The method of claim 1, further comprising: 

determining a rank for each of the documents, wherein the rank represents an importance 
of each document relative to the other documents. 

13. (Currently Amended) An article of manufacture comprising one of hardware logic 
and a computer readable medium including a program for handling redirects in documents, 
wherein the hardware logic or program causes operations to be performed, the operations 
comprising: 



Page 3 of 12 



Amdt. dated February 13, 2007 Serial No. 10/764,771 

Reply to Office action of December 14, 2006 Docket No. SVL9200301 16US1 

Firm No. 0056.0018 

while generating an index, 

forming at least one equivalence class that includes documents that are connected through 
a redirec t, wherein each equivalence class describes a redirect chain ; 

detecting cycles for each equivalence class, wherein documents in a cycle are marked so 
that they are not indexed , and, wherein, for each equivalence class, the cycle is formed when a 
last document in the redirect chain redirects to a first document in the redirect chain ; 

detecting incomplete chains for each equivalence class, wherein documents in an 
incomplete chain are marked so that they are not indexed , and, wherein, for each equivalence 
class, the indirect chain is formed when a last document in the redirect chain redirects to a 
document that has not been crawled ; and 

selecting a representative for each equivalence class whose documents are to be indexed, 
wherein the representative is associated with a path that indicates a location of a document in a 
data store. 

14. (Original) The article of manufacture of claim 13, wherein the representative is 
selected based on a type of redirect in an equivalence class. 

15. (Original) The article of manufacture of claim 13, wherein the representative is 
selected based on a rank of each document in the equivalence class. 

16. (Original) The article of manufacture of claim 13, wherein the operations further 
comprise: 

locating each document that contains a redirect; and 
creating an entry in a redirect file for each document. 

17. (Original) The article of manufacture of claim 16, wherein the entry includes a 
source path, a target path, and a redirect type. 

18. (Original) The article of manufacture of claim 13, wherein the operations further 
comprise: 

detecting duplicate documents in two different equivalence classes; and 
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merging the equivalence classes. 

19. (Original) The article of manufacture of claim 18, wherein documents are 
duplicates if a certain portion of their content is similar. 

20. (Original) The article of manufacture of claim 13, wherein the documents in the at 
least one equivalence class include a target document and one or more source documents and 
wherein the selected representative is one of the source documents, wherein the operations 
further comprise: 

propagating the content of the target document to the selected representative. 

21. (Original) The article of manufacture of claim 13, wherein the documents in the at 
least one equivalence class include a target document and one or more source documents, and 
wherein at least one source document includes a path to the target document. 

22. (Original) The article of manufacture of claim 2 1 , wherein the operations further 
comprise: 

indexing the content of the target document with a path of the representative. 

23. (Original) The article of manufacture of claim 13, wherein the operations for 
marking documents so that they are not indexed include operations for marking documents to 
indicate the documents are to be ignored. 

24. (Original) The article of manufacture of claim 13, wherein the operations further 
comprise: 

determining a rank for each of the documents, wherein the rank represents an importance 
of each document relative to the other documents. 

25 . (Currently Amended) A computer system including logic for handling redirects in 
documents, comprising: 

while generating an index. 
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forming at least one equivalence class that includes documents that are connected through 
a redirec t, wherein each equivalence class describes a redirect chain ; 

detecting cycles for each equivalence class, wherein documents in a cycle are marked so 
that they are not indexed , and, wherein, for each equivalence class, the cycle is formed when a 
last document in the redirect chain redirects to a first document in the redirect chain ; 

detecting incomplete chains for each equivalence class, wherein documents in an 
incomplete chain are marked so that they are not indexed , and, wherein, for each equivalence 
class, the indirect chain is formed when a last document in the redirect chain redirects to a 
document that has not been crawled ; and 

selecting a representative for each equivalence class whose documents are to be indexed, 
wherein the representative is associated with a path that indicates a location of a document in a 
data store. 

26. (Original) The computer system of claim 25, wherein the representative is 
selected based on a type of redirect in an equivalence class. 

27. (Original) The computer system of claim 25, wherein the representative is 
selected based on a rank of each document in the equivalence class. 

28. (Original) The computer system of claim 25, wherein the logic further comprises: 
locating each document that contains a redirect; and 

creating an entry in a redirect file for each document. 

29. (Original) The computer system of claim 28, wherein the entry includes a source 
path, a target path, and a redirect type. 

30. (Original) The computer system of claim 25, wherein the logic further comprises: 
detecting duplicate documents in two different equivalence classes; and 

merging the equivalence classes. 
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3 1 . (Original) The computer system of claim 30, wherein documents are duplicates if 
a certain portion of their content is similar. 

32. (Original) The computer system of claim 3 1 , wherein the documents in the at least 
one equivalence class include a target document and one or more source documents and wherein 
the selected representative is one of the source documents, wherein the logic further comprises: 

propagating the content of the target document to the selected representative. 

33. (Original) The computer system of claim 25, wherein the documents in the at least 
one equivalence class include a target document and one or more source documents, and wherein 
at least one source document includes a path to the target document. 

34. (Original) The computer system of claim 33, wherein the logic further comprises: 
indexing the content of the target document with a path of the representative. 

35. (Original) The computer system of claim 25, wherein marking documents so that 
they are not indexed includes marking documents to indicate the documents are to be ignored. 

36. (Original) The computer system of claim 25, wherein the logic further comprises: 
determining a rank for each of the documents, wherein the rank represents an importance 

of each document relative to the other documents. 
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